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DMA SEQUENCE ENCODING 
SQUALENE SYNTHETASE 

Background of the Invention 

The present invention relates to genetic 
engineering, particularly relating to nucleic acid 
sequences encoding enzymes useful in isoprenoid 
biosynthesis; and specifically to DNA sequences 
5 encoding enzymes involved in catalyzing the 

biosynthesis of squalene. 

Squalene is a 30 -carbon triterpene composed 
of six isoprene units. Squalene is an intermediate in 
the biosynthesis of sterols, such as, cholesterol. 

10 See, Bradfute et al . , J. Biol. Chem. , Vol. 267, p. 

18308 (1992) and Ericsson et al . . J. Biol. Chem. . Vol. 
267, p. 18708 (1992) . Squalene synthetase, an enzyme 
that resides at a branch in sterol biosynthesis, 
catalyzes the formation of squalene via reductive 

15 dimerization of the 15 -carbon intermediate farnesyl 
diphosphate (FPP) . This reaction is the first 
committed step in sterol biosynthesis. Alternatively, 
FPP may serve as substrate for numerous other 
prenyltransf erases leading to the prenylation of 

20 proteins, or to the formation of non-sterol isoprenoids 
such as dolichols, ubiquinones, and carotenoids. See, 
Kuswik-Rabiega et al . . J. Biol . Chem . . Vol. 262, p. 
1505 (1987), Summers et al . . Gene , Vol. 136, p. 185 
(1993); Keller et al . . Arch. Biochem. Biophvs . > Vol . 

25 302, p. 304 (1993); LoGrasso et al . . Arch. Biochem . 
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Bi^opl^YS., Vol. 307, p. 193 (1993); McKenzie et al . . 
Bio, q^^m.. Vol. 267, p. 21368 (1992); European Patent 
Application No. 486,290 and G.B. Patent Application 
Nos. 2,249,099 and 2,272,442. 
5 Squalene synthetase has been studied in 

mammalian systems, and is considered a key enzyme in 
the regulation of cholesterol biosynthesis. Thus, 
squalene synthetase has been studied as a possible 
target for the design of th 
10 controlling serum cholesterol levels. See, Robinson et 
al . , Mol. Ce ll, Biol . . Vol.13, p, 2706 (1993). 
Inhibitors of the enzyme 3-hydroxy-3-methylglutaryl-CoA 
reductase (HMG-CoA reductase) have been shown to reduce 
serum cholesterol levels. See, Goldstein et al . . 
15 Nature, Vol. 343, p. 425 (1990) and Brown, et al . . 
Pj-ol. Cl>em,, Vol. 253, p. 1121 (1978). However, 
because HMG-CoA reductase mediates an early step in the 
isoprenoid pathway, the potential exists for the 
depletion of other key isoprenoids derived from 
20 mevalonic acid. Selective inhibition of squalene 
synthetase on the other hand, should not directly 
suppress biosynthesis of these non- sterol isoprene 
metabolites. Consequently, inhibitors of squalene 
synthetase may have advantages over inhibitors which 
25 act further upstream in the pathway. A thorough 

knowledge of the structure and activity of this enzyme 
would greatly facilitate the design of specific 
modulators of squalene synthetase . 

Squalene has been found to have useful 
30 applications as coatings, as set forth in U.S. Patent 
Nos. 5,198,254 to Nisperos-Carriedo et al. and 
5,284,508 to Shibata et al. . in the cosmetics industry, 
as is set forth in U.S. Patent Nos. 4,699,930 to Suga 
and 5,116,607 to Jones; and in the pharmaceutical 
35 industry, as is set forth in U.S. Patent No. 4,806,352 

to Cantrell and 5,260,067 to Zheng Xu. Squalene 
synthetase inhibitors are set forth in Dufresne et al . . 
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Tetrahedron . Vol. 48, p. 10221 (1992); Hensens s£. al . , 
Tetrahedron Let . . Vol. 34, p. 399 (1993) and U.S. 
Patent Nos. 5,250,424 to Bills et al . . 5,252,4 71 to 
Byrne et al . and 5,278,067 to Dawson et al , ■ 
5 It would be highly desirable to have the 

capability to alter those biosynthetic pathways which 
involve squalene, particularly in higher plants such as 
Solanaceae. As such, it would be desirable to provide 
nucleotide sequences that encode enzymes (e.g., 

10 squalene synthetase) , which are useful in the 

biosynthetic pathway of squalene, and various other 
isoprenoids including sterols. In particular, it would 
be desirable to provide the nucleotide sequences that 
encode squalene synthetase from a higher plant species, 

15 such as a Nicbtiana species. 

Summary of the Invent: j,p^ 
The present invention relates to nucleotide 
sequences (i.e., DNA and RNA) capable of encoding a 

20 polypeptide which has enzymatic activity for producing 
squalene. Such a polypeptide is referred to as 
squalene synthetase. The DNA is isolated from a plant 
(e.g., a higher plant), in particular from a Nicotiana 
species (e.g., Nicotiana benthamiana ) . The nucleotide 

25 sequence encodes squalene synthetase, and variants of 
that sequence encodes enzymes exhibiting the same 
biological activity as squalene synthetase. The 
nucleotide sequence corresponds to, or substantially 
to^ that sequence specified by SEQ ID NO: 1. The 

30 nucleotide sequence may be provided in an isolated, 
substantially pure form. 

In another aspect, the present invention 
relates to a polypeptide which may be provided in a 
purified (e.g., isolated and substantially pure) form. 

35 The polypeptide has enzymatic activity, for producing 
squalene. That polypeptide is encoded by the. 
nucleotide sequence which corresponds to, or 
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substantially to, that sequence specified by SEQ ID NO: 
1 . The polypeptide has an amino acid sequence which 
corresponds to, or substantially to, the amino acid 
sequence specified by SEQ ID NO: 2. Polypeptides 
5 having amino acid sequences which correspond 

substantially to that encoded by the specified nucleic 
acid sequence have different amino acid secjuences 
(e.g., a minor number of amino acids of the sequence 
can be deleted, added or substituted) , but the same 

10 type of biological activities as that encoded by the 

specified sequence, although those biological 
activities may differ in degree. 

In another aspect, the present invention 
relates to fragments of polypeptides (e.g., polypeptide 

15 derivatives) encoded by nucleotide sequences which 

correspond to, or substantially to, certain nucleotide 
subsequences contained within that sequence specified 
by SEQ. ID NO: 1. Such fragments represent domains of 
the full length (i.e., intact) polypeptides. Such 

20 fragments can be, for example, transit peptides useful 
for directing polypeptides to subcellular compartments, 
or polypeptide domains having properties such as 
catalytic activity, substrate binding activity, or the 
like. 

25 The nucleotide sequence of the present 

invention can be incorporated (i.e., in an operative 
fashion) into heterologous systems (e.g., yeast, 
bacteria or certain plants) in order that the 
respective polypeptide can be synthesized thereby. The 

30 nucleotide sequence can be incorporated into plants 
(e.g., rice, corn, tobacco or tomato) using 
transformation techniques or viral gene expression 
systems. The RNA molecules or polypeptides encoded by 
those nucleotide sequences can be used to alter the 

35 biosynthetic pathway of (i) sterols (e.g., 

cycloartenol , cholesterol, obtusif oliol, and related 
compounds), or (ii) other isoprenoids (e.g., squalene. 
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phytoene, lycopene, beta -carotene, and related 
compotinds) . For example, the nucleotide sequence can 
be incorporated into a heterologous system in order to 
effect the accumulation in that system of various 
5 isoprenoids, including squalene. Alternatively, the 
polypeptides can be isolated in order to be used as 
enzymes in the in vitro synthesis of sterols or 
isoprenoids . 

In another aspect, the present invention 

10 relates to recombinant DNA or RNA molecules. Such 
molecules include DNA sequences, corresponding RNA 
seciuences, or subsequences of such DNA and RNA 
sequences. Certain sequences and subsequences (e.g., 
promoters, enhancers, terminators and replication 

15 signals) are capable of facilitating the expression of 
squalene synthetase. A DNA sequence or at least one 
subsequence has a nucleotide sequence which is 
specified, or substantially specified, by at least a 
portion of that nucleotide sequence specified by SEQ ID 

20 NO: 1. The DNA sequence is isolated from a Nicotiana 
species, such as Nicotiana benthamiana, 

The recombinant molecule can be considered a 
plasmid or a vector. The recombinant molecule can be a 
plasmid or vector tailored for transfer of the 

25 recombinant molecule to plant or other cells . 

Recombinant molecules also can be contained in a 
transgenic plant cell, such as a tobacco plant cell. 

In another aspect, the invention relates to 
the delivery and expression of sequences or 

30 subsequences as described by SEQ ID NO: 1 via transient 

virus-based gene delivery systems. Such a system is 
described in European Patent Application No. 406,267 
and U.S. Patent No. 5,316,931, which are incorporated 
herein by reference. Such a system involves delivery 

35 of a selected nucleotide sequence as part of, or in 
conjunction with, a self -replicating DNA or RNA 
molecule (i.e., a virus), such that the exogenous gene 
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is replicated and expressed during the course of 
replication and expression o£ viral or virus -based 
nucleic acids and proteins. Such gene delivery systems 
may be used for expression of nucleic acid sequences 
5 or subsequences as described by SEQ ID NO: 1 in either 
sense orientation for the expression of polypeptides, 
or in antisense orientation for the delivery of RNA 
molecules capable of inhibiting expression of the 
target gene or other homologous genes. Genes or gene 

10 sequences delivered in such a manner are considered to 
be functionally inserted in the target organism such as 
a tobacco plant . 

A further aspect of the present invention is 
a crop composed of a plurality of plants having the 

15 nucleotide sequence of the present invention, or a 

subsecjuence thereof, functionally inserted therein; and 
planted together in an agricultural field, including a 
greenhouse or growth chamber. 

Compositions including genetic sequences and 

20 siibsequences encoding enzymes for expression in plants, 
such as tobacco plants, impart those plants with the 
ability to produce altered levels of sterols and/or 
isoprenoids. As such, there is provided a method for 
altering the synthesis of sterols and isoprenoids in 

25 plants, which method involves inserting a recombinant 

genetic construct into plant cells. Such a construct 
can provide for synthesis of naturally occurring 
sterols (e.g., cycloartenol , obtusifoliol or 
cholesterol) and other isoprenoids (e.g., squalene, 

30 phytoene or beta- carotene) within such plant cells. 

For example, certain recombinant genetic constructs of 
the present invention are capable of increasing or 
decreasing expression of at least one naturally 
occurring enzyme in order that resulting transformed 

35 plants exhibit enhanced ability to produce a sterol or 
a carotenoid. As such, there is provided a method for 
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altering the biosynthetic pathway of various components 
in plants. 

In yet another aspect, the present invention 
relates to an antisense sequence for that sequence 
5 which corresponds to, or substantially to, the totality 
or a subset of that nucleic acid sequence specified by 
SEQ ID NO: !• As such, that sequence encodes RNA 
molecules capable of inhibiting expression of the 
corresponding gene and genes related thereto. Antisense 

10 RNA is capable of reducing the expression of genes 

within plant cells. See, Ecker et al . . Proc . Nat . 
Acad. Sci. USA , Vol. 83, p. 5372, (1986) and Rothstein 
et al . . Proc . Nat . Acad . Sci . USA . Vol. 84, p. 8439 
(1987) . The nucleotide sequence can be incorporated 

15 into plants (e.g., tobacco or tomato) using 

transformation techniques or viral gene expression 
systems. As a result, the present invention provides 
for a method for altering the synthetic pathway of 
sterols and isoprenoids. For example, the expression 

20 of an antisense molecule can be useful for suppressing 
or preventing synthesis of sterols (e.g., to cause a 
flux of total carbon available for biosynthesis of 
carotenoid compounds) or for preventing synthesis of 
certain isoprenoid compounds. Alternatively, 

25 expression of an antisense molecule may be useful in 
accumulating relatively high levels of certain 
molecules upstream of a particular metabolic block. 
Another example is the use of such antisense molecules 
to deliberately direct metaJDolites toward one branch of 

30 a branched pathway, such as that pathway involved in 
carotenoid biosynthesis. 

Brief Description of the Drawings 
Figures 1 and 2 illustrate maps of plasmids 
35 pBGC802 and pBGC803, respectively, which each contain 
the Nicotictna. benthamianst scjualene synthetase gene 
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cloned in as a PGR fragment into plasmid pCRII 
(Invitrogen, San Diego, CA) . 

Figure 3 illustrates a map of the plasmid 
PBGC804 containing the Nicotlana Jbenthamiana squalene 
5 synthetase gene cloned as an Ndel/Sall fragment into 
the expression vector pT7-7, 

Figure 4 illustrates a map of the 
transcription plasmid pBGCBOS containing a 63 8 bp 
antisense fragment of Nicotians benthamiana squalene 
10 synthetase cloned into the Xhol and Avrll sites of the 
RNA viral vector TTOIA, 

Detailed Description of the Preferred Embodiments 

The nucleotide sequences of genetic materials 

15 of the present invention are isolated from a plant, and 
preferably from a Nicotiana species. A plant is an 
organism belonging to the kingdom Plantae. See, Raven, 
£t_alj-/ p. 178 (1981). Most preferably, the nucleotide 
sequences are isolated from Nicotiana benthamiana . 

20 Examples of other Nicotiana species include Nicotiana 
debneyi , Nicotiana alauca. Nicotiana alutinoga . 
Nicotiana rustica. Nicotiana svaveolens and Nicotiana 
t^bacum. Examples of cultivars of Nicotiana t aba cum 
include flue-cured tobacco (e.g., NK 326), Burley 

25 tobacco (e.g., KY 14) and Maryland tobacco (e.g., MD 

609) . 

The nucleotide sequence of isolated genetic 
material of the present invention can be obtained by a 
variety of techniques. The sequence can be obtained by 

30 sequencing non-vector nucleotide sequences of 

recombinant molecules. Nucleotide sequence information 
can be obtained by employing widely used DNA sequencing 
protocols. Examples of suitable nucleotide sequencing 
protocols can be found in Berger and Kimmel, Methods in 

35 Enzvmolocrv Vol. 51, Guide to Molec ular Cloning 

Techniques . Academic Press (1987) . Nucleotide 
sequence information from several recombinant DNA 
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isolates, including isolates from both cDNA and genomic 
libraries, can be combined so as to provide the entire 
amino acid coding secfuence, as well as the nucleotide 
sequences of upstream and downstream nucleotide 
5 sequences . 

For gene isolation, mRNA is converted into 
gDNA, and the resulting cDNA is cloned. The cloned 
cDNA then can be used directly, or it, or sequences 
derived from it, can be utilized for acquiring the 

10 entire gene, either (i) from a library (e.g., in a 

lambda or plasmid vector) using sequence information to 
screen the library and detect the desired clone, or 
(ii) by amplification with PCR (polymerase chain 
reaction) and subsequent cloning into a suitable 

15 vector. For example, the 5' and 3' RACE (Rapid 

Amplification of cDNA Ends) reactions can be used to 
clone overlapping 5' and 3' ends of the gene of 
interest with subsequent assembly of the complete gene. 

Nucleotide sequences obtained from sequencing 

20 specific genetic library isolates can be subjected to 
further analysis in order to identify regions of 
interest in the genetic material. These regions of 
interest include additional open reading frames, 
promoter sequences, termination sequences, and the 

25 like. 

Isolated DNA can be characterized as being 
selected from the group consisting of: 

(a) Isolated DNA consisting of DNA 
having the nucleotide sequence 

30 which corresponds to, or 

substantially to, SEQ ID NO: 1. 

(b) Isolated DNA which hybridizes to isolated 
DNA of (a) above which encodes an enzyme or fragment 
thereof having squalene biosynthetic activity. 

35 Hybridization of such sequences may, for example, be 
carried out under stringent conditions (e.g., 
conditions represented by a wash stringency of 0 . Ix SSC 
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(ix SSC is 0.3M NaCl, 0.03M sodium citrate), 0.1% SDS 
at SO^C to DNA of (a) above) in a standard in situ 
hybridization assay. See J. Sambrook £t ai.. Molecular 
Cloning: A Laboratory M anual (2d Ed.) , Cold Spring 
5 Harbor Laboratory, Cold Spring Harbor, N.Y. (1989) . In 
general, such sequences will be at least 95% 
homologous, often at least 98% homologous, and even at 
least 99% homologous with the sequences of (a) above. 

' ^ ' — - -•w.k.w^ WMW WW ^ OW^O ^ ClhO. kJlH^ 

10 of (a) and (b) above. Homology relates to substantial 
or complete identity of nucleic acid sequences. Two 
nucleic acid fragments are homologous if they are 
capable of hybridizing to one another under 
hybridization conditions described in Maniatis et al . , 
15 Molecular Cloning: A Labor atory Manual . Cold Spring 

Harbor Laboratory, Cold Spring Harbor, N.Y., pp. 324- 
325 (1982) . Homologous sequences can be identified 
that contain less than about 5% base pair mismatches by 
using the following wash conditions: 2 x SSC, 0.1% SDS, 
20 room temperature twice, 30 minutes each; then 2 x SSC, 
room temperature twice, 10 minutes each. Homology can 
be assessed using computer programs such as DNASIS"" and 
PCGene (LKB, Hitachi Corporation, Japan, and 
Intelligenetics, Palo Alto, CA.) whereby the degree of 
25 homology is within the limits of homology considered 
significant by Bost et al . . Biochem. Bioohvs. Res, 
Commun . , Vol. 128, pp. 1373-1380 (1985). More 
preferably, homologous nucleic acid strands contain 
less than 2% base pair mismatches, even more preferably 
30 less than 1% base pair mismatches. These degrees of 
homology can be selected by using more stringent wash 
conditions for identification of clones from gene 
libraries (or other sources of genetic material), as is 
well known in the art . 
35 (d) Isolated DNA differing from the isolated 

DNA of (a) , (b) and (c) above in nucleotide sequence 
due to the degeneracy of the genetic code, and which 
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encodes either (i) a polypeptide identical to the 
polypeptide specified by SEQ ID NO: 2, or (ii) an 
enzyme or fragment thereof having sterol or isoprenoid 
biosynthetic activity. 
5 There is about 43 to about 49 percent 

conservation between the nucleotide coding region 
specified by SEQ ID NO: 1 and those sgualene synthetase 
genes set forth in Jennings, et al . . Proc . Nat . Acad . 
Sci. USA, Vol. 88, p. 6038 (1991), McKenzie, et al . , J. 

10 Biol. Chem.. Vol. 267, p. 21368 (1992) and Robinson, £t 

al., Mol. Cell. Biol . . Vol. 13, p. 2706 (1993); as 
determined by PC/GENE Multiple Sequence Alignment 
program (IntelliGenetics, Inc., Mountain View, CA) . 
With regards to the polypeptide encoded by the nucleic 

15 acid sequence specified by SEQ ID NO: 1, there is a 
21.8 percent identity and 24.6 percent similarity 
between the amino acid sequence specified by SEQ ID NO: 
2 and the squalene synthetase sequences set forth in 
Jennings, et al. , Proc. Nat. Acad. Sci. USA , Vol. 88, 

20 p. 6038 (1991), McKenzie, et al . . J. Biol. Chem . . Vol. 
267, p. 21368 (1992) and Robinson, et al . . Mol. Cell. 
Biol . . Vol. 13, p. 2706 (1993). However, there is 90 
percent homology between the conserved residues found 
in known squalene synthetase amino acid sequences and 

25 those regions of homology found in the amino acid 

secjuence specified by SEQ ID NO: 2. For the sequence 
specified by SEQ ID NO: 2, the conserved regions of 
amino acids are in positons 1, 40, 44, 48-52, 54, 55, 
58, 63, 66, 69-71, 73, 74, 77, 78, 80-82, 90, 94, 97, 

30 106, 108, 116, 122, 132, 136, 140, 147, 148, 150-152, 

168-177, 179, 180, 200-214, 216, 217, 225, 227-229, 
231, 233, 236, 248, 255, 262-264, 266, 267, 269, 271- 
273, 283, 285-294, 296-298, 303, 306, 311, 312, 314, 
316, 350, 351. 

35 The nucleotide sequences of the present 

invention are capable of encoding polypeptides having 
enzymatic activity for sterol biosynthesis and 
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isoprenoid biosynthesis. Such polypeptides have 
domains of amino acid sequences having catalytic 
activity. The catalytic domains demonstrate enzymatic 
activity, can be isolated, and can be expressed in in 
5 vivo or in vitro systems. 

A polypeptide derivative of a particular 
biosynthetic enzyme can differ in length from the 
natural enzyme, but typically contains numerous amino 
acids from the natural enzyme in the same primary order 

10 as found in that enzyme as obtained from a natural 

source. Such a polypeptide molecule* has substantially 
the same full length amino acid sequence as the natural 
enzyme but possesses minor amino acid substitutions 
that do not substantially affect the ability of that 

15 derivative to cause biosynthesis of sterols or 

isoprenoids. Derivatives include glycosylated forms, 
aggregative conjugates with other enzyme molecules and 
covalent conjugates with unrelated chemical moieties. 
Covalent derivatives are prepared by linkage of 

20 functionalities to groups which are found in the enzyme 
amino acid chain or at the N- or C- terminal residue by 
means known in the art . 

Isolated nucleotide sequences encoding 
biosynthetic enzyme can be used to produce purified 

25 enzyme or derivatives thereof by either recombinant DNA 
methodology or by in vitro polypeptide synthesis 
techniques. Purified and isolated polypeptide or 
nucleotide sequences are present in the substantial 
absence of other biological macromolecules of the same 

30 type. Purified genes and polypeptides of the present 
invention typically have at least 95% by weight, more 
preferably at least 99% by weight, and most preferably 
at least 99.8% by weight, of biological macromolecules 
of the same type present; but water, buffers, and other 

35 small molecules, especially molecules having a 

molecular weight of less than 1000, can be present. 
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Biosynthetic enzymes and polypeptide 
derivatives of those enzymes can be expressed by 
recombinant techniques when a DNA sequence encoding the 
relevant molecule is functionally inserted into a 
5 vector (e.g., in proper reading frame and orientation, 
as is well understood by those skilled in the art) . 
Typically, the relevant gene will be inserted 
downstream from a promoter and will be followed by a 
stop codon, although production as a hybrid protein 

10 followed by cleavage may be used, if desired. In 

general, host -cell -specific sequences improving the 
production yield of enzyme and enzyme derivatives can 
be used, and appropriate control sequences (e.g., 
enhancer sequences, polyadenylation sequences, and * 

15 ribosome binding sites) can be added to the expression 
vector . 

A genetic construct can be prepared and used 
to transform plant cells. The transformed plant cells 
may be cells in culture, may be present as a 

20 disorganized mass in callus, leaf explants, or shoot 
cultures, or may be a post -transformation 
differentiated plant or plant part, such as seeds, 
leaves, roots, or the like. The foreign construct 
normally is present in all or substantially all of the 

25 cells of the plant tissue, but expression may be 

limited to particular cells or particular times in the 
development of the plant. The foreign construct 
normally includes transcriptional and translational 
initiation and termination signals, with the initiation 

3 0 signals 5' to the gene of interest and the termination 

signals 3' to the gene of interest. 

The transcriptional initiation region which 
includes an RNA polymerase binding site (i.e., 
promoter) may be native to the host or may be derived 

35 from an alternative source, where the region, is 
functional in the plant host. 
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The transcriptional initiation regions may 
include, in addition to the RNA polymerase binding 
site, regions providing for regulation of 
transcription. The 3' termination region may be 
5 derived from the same gene as the transcriptional 
initiation region or from a different gene. For 
example, where the gene of interest has a 
transcriptional termination region functional in the 
host species, that region may be retained with the 
10 gene. 

An exemplary expression cassette can be 
constructed to include a) a transcriptional initiation 
region, b) the biosynthetic enzyme gene under the 
transcriptional regulational control of the 
15 transcriptional initiation region, c) the translational 

initiation codon, d) the coding sequence of the gene, 
with or without introns, and e) the translational stop 
codons, followed by f) the transcriptional termination 
legion. The transcriptional termination region 
20 includes the terminator, and may include a 

polyadenylation signal sequence, and other sequences 
associated with transcriptional termination. The 
direction is 5' -3' in the direction of transcription. 

When the expression product of the gene is to 
25 be located in a subcellular or extracellular 

compartment other than the cytoplasm, the gene usually 
is constructed to include particular amino acid 
sequences which result in translocation of the product 
to a particular site. That site may be an organelle, 
30 such as the chloroplast, mitochondrion, endoplasmic 

reticulum or nucleus, the cell plasma membrane, or may 
be secreted into the periplasmic space or into the 
external environment of the cell. Various secretory 
leaders, membrane integrator sequences, and 
35 translocation sequences for directing the peptide 

expression product to a particular site are described 
in the literature. See, for example, Cashmore et al . , 
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Biotechnolcxry, Vol. 3, pp. 803-808 (1985), and Wickner 
and liOdish, Science . Vol. 230, pp. 400-407 (1985) . 

The expression cassette normally is carried 
on a vector having at least one replication system. 
5 For convenience, it is common to have a replication 

system function in E. coli such as ColEl, pSClOl, pA- 
CYC184, or the like. In this manner, at each stage 
after each manipulation, the resulting construct may be 
cloned, sequenced, and the correctness of the 
10 manipulation determined. 

In addition to the replication system, there 
frequently is at least one marker present, which may be 
useful in one or more hosts, or different markers for 
individual hosts. That is, one marker may be employed 
15 for selection in a prokaryotic host, while another 

marker may be employed for selection in a eukaryotic 
host, particularly the plant species host. The markers 
may be protection against a biocide, such as 
antibiotics, toxins, heavy metals, or the like; or 
20 complementation, imparting prototropy to an auxotrophic 
host . 

The various fragments comprising the various 
constructs, expression cassettes, markers, and the like 
may be introduced consecutively by restriction enzyme 

25 cleavage of an appropriate replication system, and 

insertion of the particular construct or fragment into 
the available sites. After ligation and cloning, the 
vector may be isolated for further manipulation. All 
of these techniques are amply exemplified in the 

30 literature and find particular exemplification in 
Sambrook e£. Molecular Cloning : A Laboratory 

Manual . Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y., (1982). 

Once the vector is completed, the vector may 

35 be introduced into plant cells. Techniques for 
transforming plant cells include microinjection, 
particle bombardment, direct DNA update, such as using 
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polyethylene glycol, electroporation, viral infection, 
and transfozrmation with Aarobact er ium . For 
representative techniques, see, D'Halluin et al , , The 
Plant Cell, Vol. 4, p. 1495 (1992); Tomes et al . , Plant 
5 Mol , Bipl , , Vol. 14, p. 261 (1990); and Zhu et al . , 

Plant Cell. Tissue and Organ CultuT-f>. Vol. 22, p. 145 
(1991) . 

The nucleotide sequence encoding a given 
enzyme can be functionally inserted into plants or 

10 transiently expressed by virus -based gene delivery 

systems. Such a system is described in European Patent 
Application Nos. 67,553, 194,809 and 406,267, PCT WO 
93/20217 and U.S. Patent Nos. 5,304,731 and 5,316,931; 
which are incorporated herein by reference. Such a 

15 system involves delivery of a selected nucleotide 

sequence as part of, or in conjunction with, a self- 
replicating DNA or RNA molecule (e.g., a virus), such 
that the exogenous gene is replicated and expressed 
during the course of replication and expression of 

20 viral or virus-based nucleic acids and proteins. Such 
gene delivery systems, in addition to enhancing gene 
"copy number" through the replicative potential of the 
given virus or virus-based nucleic acids, facilitate 
the timed delivery of such exogenous genes at the 

25 desired state of host development. 

An exemplary virus -based gene delivery system 
employs tobacco mosaic virus . A DNA copy of the virus 
has a DNA sequence of the present invention inserted 
therein using conventional techniques of molecular 

30 cloning. The cloned DNA copy of the resulting viral 
vector then is transcribed to produce an RNA. The 
resulting RNA transcript vector then can be used to 
inoculate a grown Nicotiana benthamiana or Nicotiana 
tobacum plant by applying that RNA transcript onto a 

35 carborundum- dusted leaf in the direction of the leaf 

tip with a gloved finger. The plant so inoculated and 
infected with the virus vector is allowed to continue 
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growing for about 3 weeks and then is harvested. The 
harvested plant can be extracted immediately or frozen 
for storage purposes. This infected plant material, or 
the vector virions isolated from it, can then be used 
5 for the inoculation of a multiplicity of plants for 

purposes of significantly amplifying the delivered gene 
or gene product. In order to accomplish this, the leaf 
tiasue so collected is mixed with a lOmM phosphate 
buffer (pH 7.5) at a ratio of about 1:10 (w/v) . The 

10 mixture is macerated using a high speed blender, and 
centrifuged so as to obtain a liquid inoculum. The 
inoculum then can be applied to a growing plant, such 
as a tobacco plant, by applying the inoculum to injured 
(e.g., lacerated) regions of that plant. For example, 

15 the upper most leaves of the growing plant can be cut 
using an inoculum-wetted cutting blade (e.g., by spray 
nozzles located in the cutting blade) ; or the inoculum 
can be sprayed onto a specific region of one leaf of 
the plant as a high pressure spray. The plant is 

20 allowed to continue growing in order that the viral 

vector can systemically infect the plant. At a desired 
stage of infection, the plant is harvested. 

The following examples are provided in order 
to further illustrate the invention but should not be 

25 construed as limiting the scope thereof. Unless 
otherwise noted, all parts and percentages are by 
weight . 

Example 1 

30 Isolation of a gene encoding sgualene 

synthetase was carried out as follows: 

A. Isolation of RNA from Nicotiana 
benthamiana : 

About 100 mg of plant leaf tissue was 
35 harvested, transfered to a 1.5 ml microfuge tube, 

frozen with liquid nitrogen, and homogenized in one ml 
of AGRIsol (Biogentex, Houston, TX) using a teflon 
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pestle connected to a Moto Tool™* (Dremel Corp. , 
Racine, WI) , Immediately following homogenization, 100 
All of chloroform was introduced into the tube. The 
mixture was shaken by hand for 20 sec. Then, the 
5 mixture was inciibated for 5 min. on ice, and 

centrifuged for 10 min. at 10,000 x g at 4*C. The 
aqueous phase of the centrifuged mixture was 
transferred to a separate tube, and the RNA was 

v^^A^ A A ^ ^ ^ ^^^wt V-» -.-^ V» -« j-» /-<« W« r ^ -Z *^ -Z ^ £ ^ ..^ 1 

WV* JL. JL-^Wttl CAAGi.^ ^JIXGIO^ C&l-IUL ^ W ^ A \J X. KJLIXZ 

10 of isopropanol. After incubation on ice for 5 min. 

that sample was centrifuged for 10 min. at 10,000 x g 
at 4'C. The resulting pellet was collected and washed 
twice with ethanol (75% in water) . The pellet was 
air-dried for about 15 min. The pellet was resuspended 

15 in 20 ^1 RNase*free dHaO. The RNA concentration was 
calculated by determining the ODjco nm (1 ODjgo nra = 40 
ptg/ml) . 

B- Isolation of an internal cDNA fragment of 
sgualene synthetase . 

20 The literature was examined for nucleotide or 

amino acid sequences from human and yeast encoding the 
gene for squalene synthetase. Regions of conserved 
amino acid sequence were used to design suitable 
degenerate oligonucleotide primers. See Robinson et 

25 al , , Molecular and Cellular Biolocrv . Vol. 13, p. 2706 

(May 1993) . Primers were synthesized from these 
sequences on an Applied Biosystems Model 3 91 PCR Mate, 
or were obtained from commercial sources. 

Approximately 150 ng of total RNA was mixed 

30 with 10 pg of primer SSP-1, (5'-GGC TCG AGA TIG CCA TIA 
CYT GNG GNA TNG CXC AXA A-3' , where X is G or A, Y is C 
or T, N is G, A, C, or T, and I is inosine) and dried 
in a lyophilizer. Five microliters of a 2.5 mM dNTP 
mix (dATP, dCTP, dGTP, and dTTP) was added to the dried 

35 RNA-primer pellet, heated 5 min at 70**C, and cooled to 
37®C. The reaction mixture was made up to 10 fil with 
final concentrations of 50 mM Tris-HCl, pH 8.3, 50 mM 
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KCl, 5 mM MgCl2, 0*5 mM spermidine, and 10 tnM DTT. 
Five units of avian myeloblastosis virus <AMV) reverse 
transcriptase (Promega, Madison, WI) were added and the 
reaction was incubated for 2 hr at 37®C. 
5 Following first strand cDNA synthesis, 10 pg 

of primer SSP-2, (5'-GGC TCG AGT AYT GYC AYT AYG TIG 
CNG GIC TNG TNG G-3') , where X is G or A, Y is C or T, 
N is G, A, C, or T, and I is inosine) , was added along 
with 5 fil of GeneAmp® lOX PGR buffer (500 mM KCl , 100 

10 mM Tris, pH 8.3, 15 mM MgCljr and 0.01% (w/v) gelatin) 

(Perkin-Elmer Cetus, Norwalk, CT) , 30 /xl distilled, 
deionized water, and 5 units AmpliTaq® DNA polymerase 
(Perkin-Elmer Cetus) . PGR amplification was performed 
using a DNA Thermal Cycler (Perkin-Elmer Cetus) with 1 ^ 

15 cycle at 95^C for 5 min, 35 cycles of 50®C for 2 min, 

72^C for 2 min, and 95®C for 2 min, and 1 cycle of 72®C 
for 9 min. The amplified products were re*amplified in 
a 50 ^1 reaction volume with final concentrations of 50 
mM KCl, 20 mM Tris-HCl, pH 8.3, 1.5 mM MgClj, 0.01% 

20 (w/v) gelatin, 0.5 mM each of dATP, dGTP, dCTP, dTTP, 

and 5 units AmpliTaq® DNA polymerase. The re-amplified 
. products were analyzed by separation on a 1% SeaKem® 
agarose (FMC BioProducts, Rockland, ME) gel in TAE (Ix 
TAE Buffer is 40 mM Tris-Acetate, 2 mM EDTA) by gel 

25 electrophoresis, and a band corresponding to the 
predicted size of 4 00 bp was excised and the DNA 
purified from the gel fragment using Geneclean™ (Bio 
101, LaJolla, CA) . 

The isolated 400 bp PGR fragment was cloned 

3 0 into plasmid pCRII (Invitrogen Corporation, San Diego, 
CA) • Clones containing the 400 bp insert were subjected 
to DNA sequence analysis using the Sequenase® Version 
2 . 0 DNA Sequencing Kit . (United States Biochemical 
Corporation, Cleveland, OH) . A representative clone, 

35 designated pSS400, was used for subsequent analysis and 
manipulation. • 
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DNA sequence was determined by separation of 
the fragments on a polyacrylamide/urea gel (BioRad, 
Richmond, CA) , drying the gel on a sheet of Whatman 3MM 
paper (Whatman International Ltd. , Maidstone England) 
5 using a BioRad Model 583 gel dryer. The dried gel was 

exposed to X-ray film, and the DNA sequence was read. 
The sequence was examined for homology with the 
published squalene synthetase sequences. The deduced 
amino acid sequence encoded by the Nicotiana 

10 benthamiana squalene synthetase cDNA fragment was found 
to be 92% homologous to the conserved regions of amino 
acid sequence reported. See Robinson et al . . Molecular 
and Cellular Bioloov . Vol. 13, p. 2706 (May 1993). All 
non- identical amino acids in these regions were 

15 conserved substitutions. 

C. 3' RACE amplification of SG[ualene 
synthetase . 

Sequence data generated from the 4 00 bp 
internal squalene synthetase cDNA fragment were used to 

20 design additional primers to facilitate the isolation 
of the complete cDNA. These were synthesized on an 
Applied Biosystems Model 391 PGR Mate, or were obtained 
from commercial sources . 

Approximately 150 ng of total RNA was heated 

25 with 10 pg of Adapter Primer (Gibco BRL, 5'-GGC CAC GCG 

TCG ACT AGT AC(T)i7-3') for 10 min . at 65<>C and then 
chilled 2 min on ice. The reaction mixture was made up 
to 20 /il total volume with final concentrations of 20 
mM Tris-HCl (pH 8.4), 50 mM KCl, 2.5 mM MgClj, 100 fig/ml 

30 BSA, 10 mM DTT, 500 nM Adapter Primer and 0.5 mM each 

dATP, dCTP, dGTP and dTTP. The mixture was 
equilibrated 2 min at 42*^C. 200 units of Superscript™ 
reverse transcriptase were added and the mixture 
incxibated 30 min. at 42 ®C. Two units of E. coli RNase 

35 H was added and the mixture incubated 10 min at 42^C to 
yield the first strand cDNA. 
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The first strand cDNA was directly amplified 
by the PGR method. A 50 ^il reaction was assembled with 
a final composition of 20 mM Tris-HCl (pH 8.4), 50 mM 
KCl, 2.5 mM , 100 /ig/ml BSA, 200 nM primer SSP-3 (5'- 
5 TCC ATG GGT TTA TTT CTT CAG -3') 200 nM Universal 

Amplification Primer (Gibco BRL, 5'-CUA CUA CUA CUA GGC 
CAC GCG TCG ACT AGT AC -3'), 0 . 2 mM each dATP, dCTP, 
dGTP and dTTP, and 5 units AmpliTaq® DNA Polymerase 
(Perkin-Elmer Cetus) . Amplification was in a DNA 

10 Thermal Cycler with 1 cycle of 5 min. at 94^C, 35 

cycles of 2 min at 94**C, 2 min at 55*>C, 2 min at 72**C, 
and 1 cycle of 5 min. at 72**C. An aliquot of the 
amplified PCR products was used directly in a nested 
amplification performed in a 50 ^1 reaction with a 

15 final composition of 10 mM Tris-HCl (pH 8.3), 50 mM 

kCl, 1-5 mM , 0.001% (w/v) gelatin, 0.2 mM each dATP, 
dCTP, dGTP and dTTP, 200 nM Universal Amplification 
Primer, 200 nM primer SSP-4 (5'-CAA TGT CTA AAT GAC ATG 
GTC ACT AAT GC- 3'), 1 ftl PCR products, and 2.5 units 

20 AmpliTaq® DNA Polymerase. Amplification was performed 
in a DNA Thermal Cycler with 1 cycle of 5 min. at 94 ^'C, 
35 cycles of 2 min at 94**C, 2 min at 55**C, 2 min at 
72®C, and 1 cycle of 5 min. at 72<»C. 

D. Southern Analysis of PCR Products: 

25 Nested PCR 3' RACE products were analyzed by 

separation on a 1% SeaKem® agarose (FMC BioProducts, 
Rockland, ME) gel in TAE buffer. The PCR products in 
the gel were denatured by treatment for 20 min. in 1.5 
M NaCl, 0.5 N NaOH and neutralized by soaking in 

30 several volumes of 1.5 M NaCl, 1 M Tris-HCl (pH 8. 0) . 

The PCR products were transferred to a Flash® nylon 
membrane (Stratagene Cloning Systems, La Jolla, CA) 
using a PosiBlot® pressure blotter (Stratagene) and 
were UV-crossl inked to the membrane using a 

35 Stratalinker® 2400. 

Hybridization and detection of sgualene 
synthetase homologous sequences were performed using 
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the chemiluminescent Genius® system (Boehringer 
Mannheim Biochemicals, Indianapolis, IN) • The membrane 
was prehybridized for 1 hr. at 60**C in 10 ml 
hybridization solution (5X SSC, IX Blocking solution, 
5 0.1% N-laurylsarcosinate, 0.02% sodium dodecylsulf ate 
(SDS) ; lOX Blocking solution is 10%(w/v) Blocking 
Reagent (Boehringer Mannheim Biochemicals , 
Indianapolis, IN) in 100 mM maleic acid at pH 7.5, 150 

* / • AAxwM^n w WAAO wcxc: XXI cl I'iOQex lu4U — UX — U 

10 rotating bottle hybridization incubator (Robbins 

Scientific, Sunnyvale, CA) . Heat -denatured (5 min. at 
100 ^C) probe was added to the prehybridization mixture 
at a final concentration of 1.25 ng/ml, and incubation 
of the resulting mixture was continued 4-16 hr. at 

15 60<>C. The membrane was washed twice for 5 min. in 2X 
SSC, 0.1% SDS at room temperature, and twice for 15 
min. in O.IX SSC, 0.1% SDS at 60 ®C. For 
chemiluminescent detection, the membrane was rinsed in 
100 mM maleic acid, ISO mM NaCl, and incubated for 30 

20 min. at room temperature in IX Blocking solution. 

Alkaline phosphatase -conjugated anti-digoxigenin Fab 
fragments were added (1:10,000 dilution), and 
incubation was continued for 30 min. at room 
temperature. The membrane was washed twice for 15 min. 

25 in 100 mM maleic acid, 150 mM NaCl, at room 

temperature. The membrane was equilibrated for 2 min. 
in 100 mM Tris-HCl (pH 9.5) , 100 mM NaCl, 50 mM MgClj. 
The membrane was placed on a sheet of clear acetate 
film and Lumi-Phos® 530 (Boehringer Mannheim 

30 Biochemicals, Indianapolis, IN) was added on the DNA 
side of the membrane. Another sheet of acetate film 
was used to cover the membrane, and the membrane was 
exposed to X-ray film. Positive signals on the X-ray 
film indicated the presence of PCR products containing 

35 sgualene synthetase homologous sequence. 

Chemiluminescent probe for the hybridization 
procedure was generated by labeling the 400 bp insert 
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seguences of plasmid pSS400. One /ig of plasmid pSS400 
was digested with EcoRI to excise the insert. The 
sgualene synthetase cDNA fragment was separated on an 
agarose gel and excised after visualization by ethidium 
5 bromide staining. The DNA was purified from the gel 

fragment using Geneclean^ and resuspended in 14 /xl dHjO. 
The DNA was heat -denatured 10 min at 100«^C and chilled 
on ice. The Genius® Kit 2 (Boehringer Mannheim 
Biochemicals) was used to label the fragment with 
10 digoxigenin-dUTP according to the manufacturer's 

protocols. The probe concentration was determined by 
comparison with a dilution series of a known standard 
on a Flash® membrane, detected by chemiluminescence as 
above. 

15 E. Cloning of the 3' RACE products- 

The neisted PCR 3' RACE products of scpialene 
synthetase were cloned using the pCRII vector. Clones 
were screened by digestion with EcoRl to liberate the 
insert, separation on an agarose gel, transfer to a 

20 nylon membrane, and hybridization and detection as 
described above. Clones exhibiting a positive 
hybridization signal were subjected to DNA sequence 
analysis as described above. The sequence was examined 
for homology with the published squalene synthetase 

25 sequences. Clones were identified that exhibited at 
least 90% homology to the conserved regions in other 
squalene synthetase genes. 

F. 5' RACE amplification and cloning of 
squalene synthetase . 

30 The 5' RACE procedure (Frohman et al . , Proc. 

Nat. Acad- Sci . USA , Vol, 85, pp. 8998-9002 (1988)) was 
used to amplify the 5' terminus of squalene synthetase 
mRNA. 

Four to six ^9 total Nicotiana benthamiana 
35 RNA was combined with 10 pmole of primer SSP-5 (5^- 

GTA AGT CAA ACA ATC TTC TAC ATG TGA-3' ) and the mix was 
heated for 10 min at 70^C and chilled on ice. The 
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reaction was brought to a volume of 20 /il in 20 mM 
Tris-HCl (pH 8.4, 50 mM KCl, 2.5 mM MgClj, 0.1 mg/ml 
BSA. 10 mM DTT, 0.2 mM each dATP, dCTP, dGTP and dTTP, 
and 200 units reverse transcriptase. The mixture was 
5 incubated for 10 min at room temperature and 50 min at 
4 2«»C. The reaction was stopped by heating for 5 min at 
90<»C and chilled on ice. Two units E. coli RNAse H 
were added and the reaction incubated 20 min at 37<»C. 

= — w»i<n mao ^tirj.j.xeu i. X om tnc prevxous 

10 reaction using Geneclean", resuspended in 16 nl dH20, 
heated 10 min at 70«»C and chilled on ice. The tailing 
reaction was initiated by adding i ^1 of lOX reaction 
buffer (200 raM Tris-HCl, (pH 8.4), 500 mM KCl , 25 mM 
MgClj, 1 mg/ml BSA) , 2 m1 2 mM CTP, and 15 units of 
15 terminal transferase (TdT) to the single-stranded cDNA. 
After 10 min at 37«»C, the TdT was inactivated for 10 
min at 70»C and the reaction chilled on ice. 

Ten fil of the dC- tailed cDNA was used in the 
amplification step. The volume of the reaction was 
20 brought to 100 ^1 by the addition of 76.5 nl dHjO, 9.5 
fil lOX reaction buffer, 0.2 mM each dATP, dCTP, dGTP 
and dTTP, 10 pmole primer SSP-6 (5'-CTG AAG AAA TAA ACC 
CAT GGA -3'), 10 pmole Anchor primer (Gibco BRL, 5' -CUA 
CUA CUA CUA GGC CAC GCG TCG ACT AGT ACG GGI IGG GII GGG 
25 IIG-3') and 5 units Tag DNA polymerase (Gibco BRL) . 

The amplification was performed with one cycle of 2 min 
at 94»C, 37 cycles of 30 sec at 94«>C, 15 sec at 55«'C, 
40 sec at 72«>C, and one cycle at 72<»C for 5 min. The 
amplification products were analyzed by agarose gel 
30 electrophoresis and a DNA fragment of approximately 800 
bp was visualized after ethidium bromide staining. 

One microliter of the first reaction was 
amplified by PGR using the Universal Amplification 
Primer (UAP) (Gibco BRL, 5'- CUA CUA CUA CUA GGC CAC 
35 GCG TCG ACT AGT AC -3') . The reaction was composed of 
80.5 ^1 dHjO, 10 fil lOX Tag polymerase buffer (Gibco 
BRL) , 5 Ml 50 mM MgClj, 1 fil of the DNA Polymerization 
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Mix (Pharmacia LKB Biotechnology, Piscataway, NJ) , i fil 
of the first amplification reaction, and 2.5 units Taq 
polymerase. The amplification was performed with one 
cycle of 2 min at 94®C, 30 cycles of 40 sec at 94*»C, 30 
5 sec at 55**C, 1 min at 72**C, and one cycle at 72 ®C for 5 
min. The amplified products were purified using a 
Magic PCR Prep® column (Promega, Madison, WI) as 
recommended by the manufacturer followed by agarose gel 
electrophoresis. The resulting 800 bpDNA fragment was 
10 excised from the agarose and purified using Geneclean"". 

The fragment was cloned into a pCRII vector using 
standard cloning techniques. The clones were subjected 
to DNA seq[uence analysis using the Sequenase Version 
2.0 DNA Sequencing Kit (United States Biochemical 
15 Corporation, Cleveland, OH) . The sequence was examined 
for homology with the published squalene synthetase 
sequences. Clones were detected that exhibited at 
least 90% homology to the conserved regions in other 
scpialene synthetase genes. 
20 G. Cloning of Squalene Synthetase. 

A region of the squalene synthetase 
encompassing the start codon and the stop codon was 
amplified using the PCR as follows: Total RNA from 500 
mg of fresh leaves of Nicotiana benthamistna was 
25 extracted with Trizol™ RNA extraction kit (Gibco BRL) . 
The first DNA strand was synthesized by a reverse 
transcription reaction using 6 /ig of total RNA as a 
template, oligo (dT) ^j.^^ as primer {0.5 iiq) , and the 
Superscript™ preamplif ication kit (Gibco BRL) . The 
30 RNA-primer mix was brought to a total volume of 14 ^1, 
heated for 10 min at TO^'C, then chilled on ice. - The 
reaction volume was adjusted to 20 ^1 by adding 2^1 of 
lOX synthesis buffer (200 mM Tris-HCl (pH 8.4), 500 mM 
KCl, 25 mM MgCl^, 1 mg/ml BSA) , 2/il of 0 . 1 M DTT, 1 ^1 
35 of 10 mM dNTP mix and 1 iil (200 units) ,of reverse 

transcriptase. The mixture was incubated for 10 min at 
room temperature and 50 min at 42**C. The reaction was 
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stopped by heating 5 min at 90«C and chilled on ice. 
Then, 1 fil of E. coli RNAse H (two units) was added and 
the mixture incubated 20 min at 37«*C. 

For the polymerase chain reaction (PGR) step, 
5 two primers (10 pmole of each) were added (SSP-7 (5'- 
CAA CGC TTT CTC CTC TTA CCA -3') and SSP-8 (5'-CGT CTA 
AGA TCG GTT TCC GGA -3')) and the reaction mix volume 
increased to 100 ^1 with 8 /il of 10 X synthesis buffer, 
0.5 111 of Tag DNA polymerase and water. PCR was 
performed using 94°C for 40 sec, 55°C for 20 sec and 
72**C for 1 min for 35 cycles, and 94 ^C for 40 sec, 55 ^C 
for 20 sec and 72«C for 6 min for one cycle. 

The PCR products were analyzed by agarose gel 
electrophoresis. A DNA fragment of the expected size 
of 1275 bp was excised from the agarose gel, and 
purified using Geneclean'". The PCR product was then 
ligated to the pCR II using the manufacturer's 
recommendations. Following transformation, fourteen 
colonies were analyzed by plasmid DNA isolation and 
20 digestion with Eco RI . Plasmid DNA from three 

independant clones containing the expected 1275 bp 
insert were purified using the Wizard midiprep 
purification kit (Promega Corp.) and the sequences 
determined. A single example (designated pBGC802, 
25 Figure 1) was chosen and used as a template for 
specific amplification of the coding region. 

Amplification of the squalene synthetase 
coding region was carried out as follows: 
Plasmid pBGC802 (10 ng) was used as a template in 
30 amplification by PCR using primers SSP-9 (5'-GCG CAT 

ATG GGG AGT TTG AGG GCT ATT Crr-3') and SSP-10 (5'- CGA 
GTC GAC TAA GAT CGG TTT CCG GAT AG -3') . Primer SSP-9 
was homologous to squalene synthetase gene sequences 
containing the translational initiation codon. 
35 Additionally, several nucleotides were changed to 
introduce an Nde 1 restriction site in the primer. 
Primer SSP-10 was complementary to sqrualene synthetase 
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gene sequences containing the translational termination 
codon with several nucleotides changed to create a Sal 
I restriction site downstream from the stop codon. 

The amplification reactions consisted of 80.5 
5 Ml of water, 10 /xl of lOX Taq DNA polymerase buffer 
(Gibco BRL) , 5 fil of MgCla 50 mM, 1 fxl (10 pmole) of 
each primer SSP-9 and SSP-10, 1 /xl of the DNA 
Polymerization Mix, 1 fil of the uncut plasmid DNA 
(pBSG802 at 10 ng//xl) and 0 . 5 /il (2 .5 units) of Tag DNA 

10 Polymerase (Gibco BRL) . The amplifications were 

performed with 4 min incubation at at 94 ®C; 2 cycles of 
40 sec at 94**C, 20 sec at 45<*C, 1 min at 72*^C; 20 
cycles of 40 sec at 94®C, 20 sec at 55®C, 1 min at 
72®C; and 5 min incubation at 72®C. 

15 The amplified products were analyzed by 

agarose gel electrophoresis and the single DNA fragment 
of 1240 bp was purified using Geneclean®. The PGR 
products were ligated to the pCR II vector using the 
directions of the manufacturer. The ligation products 

20 were used to transform E. coll competent cells 

(OneShot™ INVofF' competent cells, Invitrogn Corp. 
Analysis of plasmid DNA from the resulting bacterial 
colonies revealed an insert of approximately 124 5 bp. 
Plasmid DNA was purified using the Wizard midiprep 

25 purification kit and the DNA sequence determined. The 
resulting clone, designated pBGC803 (Figure 2) , thus 
consisted primarily of the Nicotiana benthamiana 
sgualene synthetase coding region. 

Cloning of the sgualene synthetase gene in a 

30 bacterial expression vector was carried out as follows: 

Plasmid pBGC803 was cut with Sal I and with 
Nde I in order to excise the squalene synthetase coding 
region. The digestion products were analyzed by 
agarose gel electrophoresis and the 1240 bp DNA 

35 fragment purified using Geneclean". Plasmid pT7-7 

(Tabor et al . , Proc . Nat . Acad . Sci , USA . Vol. 82, p. 
1074 (1985) ) was cut with Nde I and Sal I and the 
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squalene synthetase coding region was inserted followed 
by transformation of DHIOB competent cells (Gibco BRL) . 
One of the resulting clones, designated pBGC804 
(Figure 3), contains the Nde 1/Sal 1 DNA fragment of 
5 clone pBGC803, 

Competent cells of bacterial strain BL21 
(DEB) (Novagen, Inc., Madison, WI) were transformed 
with plasmid pBGC804. A culture was started from a 

well isolat'f»H r?ol OTTV T fn^Hlia /-'r\nt- -i r» -i r-*/^ /r«T ^-P 

■ =* ==» — W.- ... ««k w WAA 1,. ^ W / W X. 

10 ampicillin. When the ODgoo was approximately equal to 

0.4, IPTG was added to a final concentration of 0.4 mM. 
The culture was maintained at 37<>C with agitation. At 
0, 10 min, 30 min, 60 min, 90 min, 120 min and 240 min 
post -induction with IPTG, aliquots of 1.5 mL were taken 

15 and the cells pelleted at 14,000 rpm for 3 min. The 
cell pellet was then resuspended with 100 /il of IX 
loading buffer and 10 /xl of each aliquot was analyzed 
for total protein content using a 12% SDS- 
polyacrylamide precast gel (Novex, San Diego, CA) . A 

20 protein of approximately 45 kDa increased in intensity 
from time 0 to time 240 min post -induction with IPTG. 
. The calculated molecular weight for the squalene 
synthetase protein is 47,073. 
Example 2 

25 Insertion of a 700 bp antisense Nicotiana 

benthamiana squalene synthetase sequence into a virus - 
based gene expression system and expression in 
transfected plants was accomplished as follows: 

A. Insertion of the squalene synthetase gene 

30 into an RNA viral vector: RNA viral vector TTOIA was 

constructed as follows: 

An 861 bp fragment from tomato mosaic virus 
virion RNA (fruit necrotic strain F; ToMV-F) was 
isolated by RT-PCR using ToMV primers 5'-CTC GCA AAG 

35 TTT GGA ACC AAA TCC TC-3' (upstream) and S'-CGG GGT ACC 

TGG GCC CCA ACC GGG GGT TCC GGG GG-3' (downstream) and 
subcloned into the Hindi site of pBluescript KS- 
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(Stratagene, LaJolla, CA) . A hybrid virus consisting 
of TMV-Ul and ToMV-F was constructed by swapping an 874 
bp BaiaHI/Kpnl ToMV fragment into pB6C152 . See, Kumagai 
et al , . Proc, Nat. Acad. Sci, USA . Vol. 89, p. 427 
5 (1993), creating plasmid pTTOl. A unique AvxJJ site 

was inserted downstream of the XhoJ site in TTOl by PGR 
mutagenesis, creating plasmid TTOIA, using the primers 
5' -TCC TCG AGC TAG GCT CGC AAA GTT TCG T^C CAA ATC CTC 
A- 3' (upstream) and 5'-CGG GGT ACT GGG CCC CAA CCG GGG 

10 GTT CCG GGG G-3' (downstream) . 

The squalene synthetase 3 ' RACE product was 
cloned into plasmid pCRII . Unique Xhol and Avrll sites 
were inserted into a 642 bp antisense fragment by PCR 
mutagenesis. A 50 fil reaction was assembled with a 

15 final composition of 20 mM Tris-HCl (pH 8.4), 50 mM 

KCl, 2.5 mM MgClj, 0.01% (w/v) gelatin, 0.5 mM each of 
dATP, dCTP, dGTP.and dTTP, 200 nM primer SSP-11 (5' - 
GCG GCT CGA GGA CTC TAC AAC AC-3') , 200 nM primer SSP- 
12 (5'- CGG CCT AGG ACA TGT CTG CTT TGC-3'), and 5 

20 units AmpliTaq® DNA polymerase. The amplification was 
perforTfied with one cycle of 2 min at 94 ®C, 30 cycles of 
40 sec at 94**C, 30 sec at 55**C, 1 min at 72**C, and one 
cycle at 72 ^C for 5 min. The amplified products were 
analyzed by separation on a 1% SeaKem® agarose gel in 

25 TAE by gel electrophoresis and a band corresponding to 

the predicted size was excised and the DNA purified 
using Geneclean™. The isolated antisense squalene 
synthetase fragment was cloned into the Xhol and Avrll 
sites of TTOIA to create plasmid pBGC805 (Figure 4) . 

30 Plasmid pBGC805, was linearized with Kpnl and 

transcripts were synthesized in vitro using a 
MEGAscript SP6 Kit and m7G (5' )ppp (5 ' ) G (Ambion, Austin, 
TX) . N. icotiana benthamiana plants were inoculated 
with in vitro transcripts and maintained in a growth 

35 chamber at 25 °C and a 12 hr light /dark regime for 13 
days. At 6 and 13 days post -inoculation, infected 
leaves were harvested and analyzed for squalene 
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synthetase activity. Isolation of total plant 
membranes and assay for SG[ualene synthetase enzyme 
activity were performed as previously described. See 
Hanley e£_al. , Plant Phvsiol . . Vol. 98, p. 215 (1992). 
By 13 days post -inoculation scjualene synthetase 
activity in the plants transfected with the anti- 
squalene synthetase construct was undetectable. 
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SBQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Hanley, Kathleen M. 

Nicolas, Olivier 
Hellmann, Gary M. 

(ii) TITLE OF INVENTION: DNA Sequence encoding squalene 
synthetase 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: R. J. Reynolds Tobacco Company 

(B) STREET: 950 Reynolds Blvd., P. O. Box 1487 

(C) CITY: Wins ton- Sal em 

(D) STATE: North Carolina 

(E) COUXOTRY: U. S. A. 
<F) ZIP: 27102-1487 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Borschke, August J. 

(B) REGISTRATION NUMBER: 30,539 

(C) REFERENCE/DOCKET NUMBER: CC-212A 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 910-741-5491 
<B) TEI^FAX: 910-741-5449 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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<vi) ORIGIKAL SOURCE: 

(A) ORGANISM: Hicotiana benthamiana 
(F) TISSUE TYPE: Leaf 

(XX) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 112 .. 1344 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATCATCGTTT CCACGCAATT CATCGTTCAA CGACTTCTTC AAATACAACC ATTTTGCCAA 60 

CGCTTTCTCC TCTTACCAAA CAAACAACCC CCCACCAAAG GCTGAATAAG A ATG GGG 117 

Met Gly 
1 

AGT TTG AGG GCT ATT CTG AAG AAT CCA GAG GAT TTA TAT CCA TTG GTG 165 
Ser Leu Arg Ala lie Leu Lys Asn Pro Glu Asp Leu Tyr Pro Leu Val 
5 10 15 

AAG CTG AAG CTA GCG GCT CGA CAC GCG GAG AAG CAG ATC CCG CCG TCT 213 
Lys Leu Lys Leu Ala Ala Arg His Ala Glu Lys Gin lie Pro Pro Ser 
20 25 30 

CCA AAT TGG GGC TTC TGT TAC TCA ATG CTT CAT AAG GTT TCT CGT AGC 261 
Pro Asn Trp Gly Phe Cys Tyr Ser Met Leu His Lys Val Ser Arg Ser 
35 40 45 50 

TTT GCT CTC GTC ATT CAA CAA CTT CCA GTC GAG CTT CGT GAC GCC GTG 309 
Phe Ala Leu Val lie Gin Gin Leu Pro Val Glu Leu Arg Asp Ala Val 
55 60 65 

TGC ATT TTC TAT TTG GTT CTT CGA GCA CTT GAC ACT GTT-GAG GAT GAT 357 
Cys He Phe Tyr I^u Val Leu Arg Ala Leu Asp Thr Val Glu Asp Aap 
70 75 80 

ACC AGC ATT CCC ACC GAT GTT AAA GTT CCT ATT CTG ATC TCT TTT CAT 405 
Thr Ser He Pro Thr Asp Val Lys Val Pro He I-eu He Ser Phe His 
85 90 95 

CAG CAT GTT TAT GAT CGC GAA TGG CAT TTT TCA TGT GGT ACA AAG GAG 453 
Gin His Val Tyr Asp Arg Glu Trp His Phe Ser Cys Gly Thr Lys Glu 
100 105 110 

TAC AAG GTT CTC ATG GAC CAG TTC CAT CAT GTA TCA ACT GCT TTT CTG 501 
Tyr Lys Val Leu Met Asp Gin Phe His His Val Ser Thr Ala Phe Leu 
lis 120 125 130 

GAG CTT AGG AAA CAT TAT CAG CAG GCA ATT GAG GAT ATT ACC ATG AGG 549 
Glu Leu Arg Lys His Tyr Gin Gin Ala He Glu Asp He Thr Met Arg 
135 140 145 

ATG GGT GCA GGA ATG GCA AAA TTC ATA TGC AAG GAG GTG GAA ACA ACC 597 
Met Gly Ala Gly Met Ala Lys Phe He Cys Lys Glu Val Glu Thr Thr 
150 155 160 
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GAT GAT TAT GAC GAA TAT TGT CAC TAT GTA GCT GGG CTT GTT GGG CTA 645 
Asp Asp Tyx Asp Glu Tyr Cys His Tyr Val Ala Gly Xjeu Val Gly Leu 
165 170 175 

GGA TTG TCA AAA CTG TTC CAT GCC TCT GAG AAA GAA GAT CTG GCT TCA 693 
Gly Leu Ser Lys Leu Phe His Ala Ser Glu Lys Glu Asp Leu Ala Ser 
180 185 190 

GAT TCT CTC TCC AAC TCC ATG GGT TTA TTT CTT CAG AAA ACA AAC ATC 741 
Asp Ser Leu Ser Asn Ser Met Gly Leu Phe Leu Gin Lys Thr Asn lie 
195 200 205 210 

ATT AGA GAT TAT TTG GAA GAC ATA AAT GAA GTA CCC AAG TGC C6T ATG 789 
lie Arg Asp Tyr Leu Glu Asp lie Asn Glu Val Pro Lys Cys Arg Met 
215 220 225 

TTC TGG CCC CGT GAA ATA TGG AGT AAA TAT GTT AAC AAG CTT GAG GAA 837 
Phe Trp Pro Arg Glu lie Trp Ser Lys Tyr Val Asn Lys Leu Glu Glu 
230 235 240 

TTA AAG TAC GAG GAT AAC TCG GCC AAA GCA GTG CAA TGT CTA AAT GAC 885 
Leu Lys Tyr Glu Asp Asn iser Ala Lys Ala Val Gin Cys Leu Asn Asp 
245 250 255 

ATG GTC ACT AAT GCT TTA TCA CAT GTA GAA GAT TGT TTG ACT TAC ATG 933 
Met Val Thr Asn Ala Leu Ser His Val Glu Asp Cys Leu Thr Tyr Met 
260 265 270 

TCT GCT TTG CGT GAT CCT TCC ATC TTT CGA TTC TGT GCT ATT CCA CAG 981 
Ser Ala Leu Arg Asp Pro Ser lie Phe Arg Phe Cys Ala lie Pro Gin 
275 280 285 290 

GTC ATG GCA ATT GGG ACA TTA GCT AT G TGC TAC GAC AAC ATT GAA GTC 1029 
Val Met Ala He Gly Thr Leu Ala Met Cys Tyr Asp Asn He Glu Val 
295 300 305 

TTC AGA GGA GTG GTA AAA ATG AGA CGT GGT CTG ACT GCT AAG GTC ATT 1077 
Phe Arg Gly Val Val Lys Met Arg Arg Gly Leu Thr Ala Lys Val He 
310 315 320 

GAC CGG ACC AGG ACT ATT GCA GAT GTA TAT GGT GCT TTT TTT GAC TTT 1125 
Asp Arg Thr Arg Thr He Ala Asp Val Tyr Gly Ala Phe Phe Asp Phe 
325 330 335 

TCT TGT ATG CTG AAA TCC AAG GTT AAT AAT AAT GAT CCA AAT GCA ACA 1173 
Ser Cys Met Leu Lys Ser Lys Val Asn Asn Asn Asp Pro Asn Ala Thr 
340 345 350 

AAA ACT CTG AAG AGG CTC GAA GTG ATC CTG AAA ACT TGC AGA GAT TCG 1221 
Lys Thr Leu Lys Arg Leu Glu Val He Leu Lys Thr Cys Arg Asp Ser 
355 360 365 370 

GGA ACC TTG AAC AAA AGG AAA TCC TAC ATA ATC AGG AGC GAG CCT AAT 1269 
Gly Thr Leu Asn Lys Arg Lys Ser Tyr He He Arg Ser Glu Pro Asn 
375 380 385 
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TAC AGT CCA GTT CTG ATT GTT GTC ATT TTC ATC ATA CTG GCT ATT ATT 1317 
Tyr Ser Pro Val Leu He Val Val He Phe He He Leu Ala He He 
390 395 400 

CTC OCA CAG CTA TCC GGA AAC CGA TCT TAGACGATAT TTTGGGTTAC 1364 
Leu Ala Gin Leu Ser Gly Asn Arg Ser 
405 410 

AAAAAAGAAG TCTGGTCAAG GAAGACAGCA GAAGCTCTTG GCCAATTGTG TGATTAGTGC 1424 

AGATTTTGAT GTTTGTAATT CTATCGTCCA TTAAGTGATA GTTGCACCTT TTAACCTGAC 1484 

AAGATAATTA CGAAGACCTA TTTTTGGTGG TTTGTTGGGT ATGTACTTGT TTGCAAGGCT 1544 

ACGTAAGCAA ATTCCAAGTG TT6TAGAGTC ACCGTGATGT AATAAACATG TCTTTTATTA 1604 

TAGTTTGTTC ATTTTTGTGG TAAAAAAAAA AAAAAAAA 1642 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Gly Ser Leu Arg Ala He Leu Lys Asn Pro Glu Asp Leu Tyr Pro 
15 10 15 

X«eu Val Lys Leu Lys Leu Ala Ala Arg His Ala Glu Lys Gin He Pro 
20 25 30 

Pro Ser Pro Asn Trp Gly Phe Cys Tyr Ser Met Leu His Lys Val Ser 
35 40 45 

Arg Ser Phe Ala Leu Val He Gin Gin Leu Pro Val Glu Leu Arg Asp 
50 55 60 

Ala Val Cys He Phe Tyr Leu Val Leu Arg Ala Leu Asp Thr Val Glu 
65 70 75 80 

Asp Asp Thr Ser He Pro Thr Asp Val Lys Val Pro He Leu He Ser 
85 90 95 

Phe His Gin His Val Tyr Asp Arg Glu Trp His Phe Ser Cys Gly Thr 
100 105 110 

Lys Glu Tyr Lys Val Leu Met Asp Gin Phe His His Val Ser Thr Ala 
115 120 125 

Phe Leu Glu Leu Arg Lys His Tyr Gin Gin Ala He Glu Asp He Thr 
130 135 140 
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Met Arg Met Gly Ala Gly Met Ala Lys Phe lie Cys Lys Glu Val Glu 

145 150 155 160 

Thr Thr Asp Asp Tyr Asp Glu Tyr Cys His Tyr Val Ala Gly Leu Val 
165 170 175 

Gly Leu Gly Leu Ser Lys Leu Phe His Ala Ser Glu Lys Glu Asp I«eu 
180 185 190 

Ala Ser Asp Ser Leu Ser Asn Ser Met Gly Leu Phe Leu Gin Lys Thr 
195 200 205 

Asn lie lie Arg Asp Tyr Leu Glu Asp lie Asn Glu Val Pro Lys Cys 
210 215 220 

Arg Met Phe Trp Pro Arg Glu lie Trp Ser Lys Tyr Val Asn Lys Leu 
225 230 235 240 

Glu Glu Leu Lys Tyr Glu Asp Asn Ser Ala Lys Ala Val Gin Cys Leu 
245 250 255 

Asn Asp Met Val Thr Asn Ala Leu Ser His Val Glu Asp Cys Leu Thr 
260 265 270 

Tyr Met Ser Ala Leu Arg Asp Pro Ser lie Phe Arg Phe Cys Ala He 
275 280 . 285 

Pro Gin Val Met Ala lie Gly Thr Leu Ala Met Cys Tyr Asp Asn He 
290 295 300 

Glu Val Phe Arg Gly Val Val Lys Met Arg Arg Gly Leu Thr Ala Lys 
305 310 315 320 

Val He Asp Arg Thr Arg Thr He Ala Asp Val Tyr Gly Ala Phe Phe 
325 330 335 

Asp Phe Ser Cys Met Leu Lys Ser Lys Val Asn Asn Asn Asp Pro Asn 
340 345 350 

Ala Thr Lys Thr Leu Lys Arg Leu Glu Val He Leu Lys Thr Cys Arg 
355 360 365 

Asp Ser Gly Thr Leu Asn Lys Arg Lys Ser Tyr He He Arg Ser Glu 

370 375 380 

Pro Asn Tyr Ser Pro Val Leu He Val Val He Phe He He Leu Ala 
385 390 395 400 

He He Leu Ala Gin Leu Ser Gly Asn Arg Ser 
405 410 
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THAT WBXCH XS CLAIMED IS*: 

1. An isolated nucleotide molecule encoding a 
polypeptide which has enzymatic activity for producing 
sgualene, which nucleotide molecule is isolated from a 
Nicotiana species. 

5 

2. An isolated nucleotide molecule encoding a 
polypeptide which has enzymatic activity for producing 
squalene, which nucleotide molecule encodes a 
polypeptide having an amino acid secpience specified by 

10 SEQ ID NO: 2. 

3 . An isolated nucleotide molecule encoding 
sgualene synthetase selected from the group consisting 
of: 

15 (a) isolated nucleotide molecules consisting 

essentially of the sequence specified by SEQ ID NO:l; 

(b) isolated nucleotide molecules which 
hybridize to an isolated nucleotide molecule of (a) 
above under conditions represented by a wash stringency 

20 of 0.3M NaCl, 0 . 03M sodium citrate, and 0*1% SDS at 

60*^C, which are at least about 95% homologous to 
isolated nucleotide molecules of (a) above and which 
encode squalene synthetase; and 

(c) isolated nucleotide molecules which 
25 differ in sequence from the isolated nucleotide 

molecules of (a) and (b) above due to the degeneracy of 
the genetic code, and which encode squalene synthetase. 

4. The nucleotide molecule of Claim 1 in the 
30 form of DNA. 

5. The nucleotide molecule of Claim 2 in the 
form of DNA. 
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6 . The nucleotide molecule of Claim 3 in the 
form of DNA. 

7. The nucleotide molecule of Claim 5 
5 isolated from a Nicotians species. 

8. The nucleotide molecule of Claim 6 
isolated from a Nicotxana. species. 

10 9. The nucleotide molecule of Claim 4 

isolated from Nicotlana benthamiana. 

10. The nucleotide molecule of Claim 5 
isolated from Nicotiana benthamiema , 



15 



11. The nucleotide molecule of Claim 6 
isolated from Nicotiana benthamiana . 
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